Mixture model of Gaussian copulas to cluster mixed-type data

نویسنده

  • Matthieu Marbac
چکیده

A mixture model of Gaussian copulas is proposed to cluster mixed data. This approach allows to straightforwardly define simple multivariate intra-class dependency models while preserving classical distributions for the one-dimensional margins of each component in order to facilitate the model interpretation. Moreover, the intra-class dependencies are taken into account by the Gaussian copulas which provide one robust correlation coefficient per couple of variables and per class. This model generalizes different existing models defined for homogeneous or mixed variables. The Bayesian inference is performed via a Metropolis-within-Gibbs sampler. The model is illustrated by a real data set clustering.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Model-based clustering of Gaussian copulas for mixed data

Clustering task of mixed data is a challenging problem. In a probabilistic framework, the main difficulty is due to a shortage of conventional distributions for such data. In this paper, we propose to achieve the mixed data clustering with a Gaussian copula mixture model, since copulas, and in particular the Gaussian ones, are powerful tools for easily modelling the distribution of multivariate...

متن کامل

Dependency Clustering of Mixed Data with Gaussian Mixture Copulas

Heterogeneous data with complex feature dependencies is common in real-world applications. Clustering algorithms for mixed – continuous and discrete valued – features often do not adequately model dependencies and are limited to modeling meta–Gaussian distributions. Copulas, that provide a modular parameterization of joint distributions, can model a variety of dependencies but their use with di...

متن کامل

Mixture of D-vine copulas for modeling dependence

The identification of an appropriate multivariate copula for capturing the dependence structure in multivariate data is not straightforward. The reason is because standard multivariate copulas (such as the multivariate Gaussian, Student-t, and exchangeable Archimedean copulas) lack flexibility to model dependence and have other limitations, such as parameter restrictions. To overcome these prob...

متن کامل

IMAGE SEGMENTATION USING GAUSSIAN MIXTURE MODEL

  Stochastic models such as mixture models, graphical models, Markov random fields and hidden Markov models have key role in probabilistic data analysis. In this paper, we have learned Gaussian mixture model to the pixels of an image. The parameters of the model have estimated by EM-algorithm.   In addition pixel labeling corresponded to each pixel of true image is made by Bayes rule. In fact, ...

متن کامل

Basket Default Swaps, CDO’s and Factor Copulas

We consider a factor approach to the pricing of basket credit derivatives and synthetic CDO tranches. Our purpose is to deal in a convenient way with dependent defaults for a large number of names. We provide semi-explicit expressions of the stochastic intensities of default times and pricing formulae for basket default swaps and CDO tranches. Two cases are studied in detail: mean-variance mixt...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014